AITopics | grammatically correct

Collaborating Authors

grammatically correct

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Reviews: A Simple Proximal Stochastic Gradient Method for Nonsmooth Nonconvex Optimization

Neural Information Processing SystemsOct-8-2024, 08:12:46 GMT

This paper focuses on the optimization problem min f(x) h(x), where f is of a finite sum structure (with n functions in the sum), with nonconvex but smooth components, and h is a convex but possibly nonsmooth function. So, this is a nonconvex finite sum problem with a convex regularizer. Function h is treated using a prox step. The authors propose a small modification to ProxSVRG (called ProxSVRG), and prove that this small modification has surprisingly interesting consequences. The modification consists in replacing the full gradient computation in the outer loop of ProxSVRG by an approximation thereof through subsampling/minibatch (batch size B).

artificial intelligence, machine learning, proxsvrg, (14 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Mathematical & Statistical Methods (0.42)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Gradient Descent (0.42)

Add feedback

Rethinking Semantic Parsing for Large Language Models: Enhancing LLM Performance with Semantic Hints

An, Kaikai, Si, Shuzheng, Hu, Helan, Zhao, Haozhe, Wang, Yuchi, Guo, Qingyan, Chang, Baobao

arXiv.org Artificial IntelligenceSep-22-2024

Semantic Parsing aims to capture the meaning of a sentence and convert it into a logical, structured form. Previous studies show that semantic parsing enhances the performance of smaller models (e.g., BERT) on downstream tasks. However, it remains unclear whether the improvements extend similarly to LLMs. In this paper, our empirical findings reveal that, unlike smaller models, directly adding semantic parsing results into LLMs reduces their performance. To overcome this, we propose SENSE, a novel prompting approach that embeds semantic hints within the prompt. Experiments show that SENSE consistently improves LLMs' performance across various tasks, highlighting the potential of integrating semantic information to improve LLM capabilities.

computational linguistic, entail, structure and semantic, (13 more...)

arXiv.org Artificial Intelligence

2409.14469

Country:

North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
Asia > Singapore (0.04)
North America > United States > Washington > King County > Seattle (0.04)
(2 more...)

Genre: Research Report (0.82)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Grammars & Parsing (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.98)

Add feedback

Language models align with human judgments on key grammatical constructions

Hu, Jennifer, Mahowald, Kyle, Lupyan, Gary, Ivanova, Anna, Levy, Roger

arXiv.org Artificial IntelligenceJan-19-2024

Do Large Language Models (LLMs) make human-like linguistic generalizations? Dentella et al. (5) (DGL) prompt several LLMs ("Is the following sentence grammatically correct in English?") to elicit grammaticality judgments of 80 English sentences, concluding that LLMs demonstrate a "yes-response bias" and a "failure to distinguish grammatical from ungrammatical sentences". We re-evaluate LLM performance using well-established practices and find that DGL's data in fact provide evidence for just how well LLMs capture human linguistic judgments. Children learn to produce well-formed sentences without necessarily being able to articulate the underlying grammatical rules, a distinction long noted in linguistics (e.g., 1; 6; 3). DGL blur this distinction: their task requires not just grammatical competence, but also knowing what "grammatically correct" means.

judgment, minimal pair, ungram, (13 more...)

arXiv.org Artificial Intelligence

2402.01676

Country:

North America > United States > Wisconsin > Dane County > Madison (0.05)
North America > United States > Texas > Travis County > Austin (0.05)
North America > United States > Massachusetts > Middlesex County > Cambridge (0.05)
(3 more...)

Genre: Research Report (0.40)

Technology: Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)

Add feedback

FiLM: Fill-in Language Models for Any-Order Generation

Shen, Tianxiao, Peng, Hao, Shen, Ruoqi, Fu, Yao, Harchaoui, Zaid, Choi, Yejin

arXiv.org Artificial IntelligenceOct-15-2023

Language models have become the backbone of today's AI systems. However, their predominant left-to-right generation limits the use of bidirectional context, which is essential for tasks that involve filling text in the middle. We propose the Fill-in Language Model (FiLM), a new language modeling approach that allows for flexible generation at any position without adhering to a specific generation order. Its training extends the masked language modeling objective by adopting varying mask probabilities sampled from the Beta distribution to enhance the generative capabilities of FiLM. During inference, FiLM can seamlessly insert missing phrases, sentences, or paragraphs, ensuring that the outputs are fluent and are coherent with the surrounding context. In both automatic and human evaluations, FiLM outperforms existing infilling methods that rely on left-to-right language models trained on rearranged text segments. FiLM is easy to implement and can be either trained from scratch or fine-tuned from a left-to-right language model. Notably, as the model size grows, FiLM's perplexity approaches that of strong left-to-right language models of similar sizes, indicating FiLM's scalability and potential as a large language model.

arxiv preprint arxiv, language model, perplexity, (13 more...)

arXiv.org Artificial Intelligence

2310.0993

Country:

Europe > United Kingdom (0.28)
Asia > China (0.14)
Europe > Switzerland (0.04)
(13 more...)

Genre: Research Report (0.82)

Industry:

Leisure & Entertainment (1.00)
Media > Music (0.94)
Government > Regional Government (0.93)
Government > Military (0.68)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.93)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (0.73)

Add feedback

Testing AI performance on less frequent aspects of language reveals insensitivity to underlying meaning

Dentella, Vittoria, Murphy, Elliot, Marcus, Gary, Leivada, Evelina

arXiv.org Artificial IntelligenceFeb-27-2023

Advances in computational methods and big data availability have recently translated into breakthroughs in AI applications. With successes in bottom-up challenges partially overshadowing shortcomings, the 'human-like' performance of Large Language Models has raised the question of how linguistic performance is achieved by algorithms. Given systematic shortcomings in generalization across many AI systems, in this work we ask whether linguistic performance is indeed guided by language knowledge in Large Language Models. To this end, we prompt GPT-3 with a grammaticality judgement task and comprehension questions on less frequent constructions that are thus unlikely to form part of Large Language Models' training data. These included grammatical 'illusions', semantic anomalies, complex nested hierarchies and self-embeddings. GPT-3 failed for every prompt but one, often offering answers that show a critical lack of understanding even of high-frequency words used in these less frequent grammatical constructions. The present work sheds light on the boundaries of the alleged AI human-like linguistic competence and argues that, far from human-like, the next-word prediction abilities of LLMs may face issues of robustness, when pushed beyond training data.

grammatically correct, large language model, machine learning, (17 more...)

arXiv.org Artificial Intelligence

2302.12313

Country:

Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.28)
North America > United States > California > Los Angeles County > Los Angeles (0.28)
North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
(20 more...)

Genre: Research Report > New Finding (0.68)

Industry: Transportation > Air (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

AI Takes Over Ad Creativity

#artificialintelligenceAug-31-2020, 04:36:07 GMT

Paid performance marketing is a true art, and one has to be experienced enough to make the right assumptions from the start. Usually, no one wants to spend too much money on experimenting, without an acceptable return on this investment in the end. If the first assumptions were wrong or keeping low-performance rates, a human marketer has to be able to optimize the ad at a very quick pace. There are already many places where AI can be applied in online marketing, and specifically in the paid performance marketing (PPC) field. The life cycle of a paid ad, or the "ad journey", i.e an ad that was created with the aim to be published on Google, Facebook, Instagram, LinkedIn, Twitter, Pinterest or any other platforms that provide paid targeted advertisement as a service, can be described as the following: Ad creation -- where the ideation process happens in a human's head (business owner or a marketer).

artificial intelligence, platform, social media, (18 more...)

#artificialintelligence

Industry: Marketing (0.73)

Technology:

Information Technology > Communications > Social Media (1.00)
Information Technology > Artificial Intelligence (1.00)

Add feedback

Finovate 2017 AI Recap: Artificial Intelligence disrupts Fintech with Impressive Force

#artificialintelligenceSep-22-2017, 20:55:37 GMT

This September, financial institutions, venture capitalists, well established businesses, and startups alike joined together in New York City for 4 days of demos, panels, keynotes and roundtable discussions navigating the financial technology landscape. Of the 70 companies that demoed, AI dominated the discussion; with over 15% of companies insisting that "AI" is the driving force behind their tech. As the demos continued, it became increasingly clear that the disparities between those AIs are immense, and the extent to which their functionalities vary should not be overlooked. While Finovate 2016 outlined banks' need to implement AI into their platforms, Finovate 2017 identified several key factors that matter most when choosing a virtual banking assistant. In financial services, an industry overwhelmingly saturated with competition, customer service is the one true differentiator.

artificial intelligence, chatbot, natural language, (17 more...)

#artificialintelligence

Country: North America > United States > New York (0.25)

Industry: Banking & Finance > Financial Services (0.36)

Technology:

Information Technology > e-Commerce > Financial Technology (0.71)
Information Technology > Artificial Intelligence > Representation & Reasoning > Personal Assistant Systems (0.52)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (0.51)

Add feedback